AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
High-Reward Strategy

# High-Reward Strategy

Ppo LunarLander V2
This is a reinforcement learning model based on the PPO algorithm, specifically trained for the LunarLander-v2 environment to safely control lunar landings.
Physics Model
P
sofiascat
14
1
Ppo LunarLander V2
This is a reinforcement learning model based on the PPO algorithm, designed to solve control tasks in the LunarLander-v2 environment.
Physics Model
P
sigalaz
20
0
Ppo LunarLander V2
This is a reinforcement learning model based on the PPO algorithm, specifically trained for the LunarLander-v2 environment to control the safe landing of a lunar lander.
Physics Model
P
andri
16
0
Td3 Hopper V3
This is a TD3 agent model trained using the stable-baselines3 library, specifically designed for reinforcement learning tasks in the Hopper-v3 environment.
Physics Model
T
sb3
30
0
Ppo HalfCheetah V3
This is a reinforcement learning model based on the PPO algorithm, specifically designed for the HalfCheetah-v3 environment and trained using the stable-baselines3 library.
Physics Model
P
sb3
51
1
Dqn LunarLander V2
This is a DQN agent trained using the stable-baselines3 library to solve reinforcement learning tasks in the LunarLander-v2 environment.
D
araffin
54
2
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase